Generalizing and Hybridizing Count-based and Neural Language Models
نویسندگان
چکیده
Language models (LMs) are statistical models that calculate probabilities over sequences of words or other discrete symbols. Currently two major paradigms for language modeling exist: count-based n-gram models, which have advantages of scalability and test-time speed, and neural LMs, which often achieve superior modeling performance. We demonstrate how both varieties of models can be unified in a single modeling framework that defines a set of probability distributions over the vocabulary of words, and then dynamically calculates mixture weights over these distributions. This formulation allows us to create novel hybrid models that combine the desirable features of count-based and neural LMs, and experiments demonstrate the advantages of these approaches.1
منابع مشابه
P/E Modeling and Prediction of Firms Listed on the Tehran Stock Exchange; a New Approach to Harmony Search Algorithm and Neural Network Hybridization
Investors and other contributors to stock exchange need a variety of tools, measures, and information in order to make decisions. One of the most common tools and criteria of decision makers is price-to earnings per share ratio. As a result, investors are in pursuit of ways to have a better assessment and forecast of price and dividends and get the highest returns on their investment. Previous ...
متن کاملMirror Neurons and (Inter)subjectivity: Typological Evidence from East Asian Languages
Language is primarily constituted by action and interaction based on sensorimotor information. This paper demonstrates the nature of subjectivity and intersubjectivity through the neural mechanism and typological evidence of sentence-final particles from East Asian languages and extends to the discussion of the relationship between them. I propose that intersubjecivity is a kind of embedded or ...
متن کاملEfficient Method Based on Combination of Deep Learning Models for Sentiment Analysis of Text
People's opinions about a specific concept are considered as one of the most important textual data that are available on the web. However, finding and monitoring web pages containing these comments and extracting valuable information from them is very difficult. In this regard, developing automatic sentiment analysis systems that can extract opinions and express their intellectual process has ...
متن کاملFitting of Count Time Series Models on the Number of Patients Referred to Addiction Treatment Centers in Semnan County
Abstract. Count data over time are observed in many application areas. Many researchers use time series patterns to analyze this data. In this paper, the poisson count time series linear models and negative binomials on this type of data with the explanatory variables are studied. The Likelihood analysis and the evaluation of count time series model based on generalized linear models are pres...
متن کاملDaily Pan Evaporation Estimation Using Artificial Neural Network-based Models
Accurate estimation of evaporation is important for design, planning and operation of water systems. In arid zones where water resources are scarce, the estimation of this loss becomes more interesting in the planning and management of irrigation practices. This paper investigates the ability of artificial neural networks (ANNs) technique to improve the accuracy of daily evaporation estimation....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016